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0 Word organised data processors. 



0 In a data processing system configured to transfer data in words and an apparatus for manipulating, in 
response to a single manipulation instruction, the contiguous variable length sequence of data stored in bytes 
including an Instnjction decoder for decoding a single manipulation instruction, a memory for storing data in 
words where each but word includes a plurality of bytes, a circuit responsive to the command signal from the 
Instruction decoder to access a contiguous sequence of the bytes from the memory wherein the first byte of the 
sequence is not stored on a word boundary, and providing the byte sequence to a plurality of data processing 
registers wherein the first byte of the sequence is aligned with the first registered boundary. Additionally the 
apparatus includes the capability to provide the contiguous sequence to memory wherein the first byte of the 
sequence is not stored on a word boundary. The data sequence maybe specified by the number of bytes In the 
sequence or by a sequence terminating character. 
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The present invention relates to a data processing system and more specifically to an apparatus for 
manipulating cliaracter strings in a data processing system. 

Data processing systems access and manipulate data A unifomi amount of data is used for transferring 
and manipulating by the processor. This uniform amount is commonly the amount of data that is stored as 
s a single unit in a memory. In modem data processing systems, this storage unit of data Is termed a word 
and consist of 16 binary digits or bits. Words may also consist of 32. 48 or 64 bits depending on the 
characteristic of the specific data processing system. It Is also desirable to break up vwrds Into smaller 
units such as bytes (8 bits), or nibbles (4 bits). 

As data processing systems increase in capability, it is more advantageous to transfer large amounts of 
to data as single units. However, it is also desirable to be able to access small amounts of data, such as bytes 
or nibbles, in data processing operations. The storage of data units such as words in memory requires the 
word be the smallest unit of data access. In other words, when one accesses data in memory, one must 
access a word at a time. 

Characters strings are specific character data stored In a contiguous sequence. Commonly a single 
« character is less 'than a word In length, usually a byte. Since the length of character strings may vary 
storing these variable length character strings only on word boundaries would result in the inefficient use of 
memory storage, i.e.. the number of bytes in a specific character sequence are not always equal to a 
number of complete words of memory used. Therefore, it is desirable to access character sequences in 
memory that are not stored on word boundaries. 

Since previous data processing systems have used 8 bit, 16 bit or 32 bit word lengths for storage and 
transfer units, the problem of efficient storage of character strings has not been of great Interest However 
more modem data processing systems are increasing, the number of bits in a storage or transfer unit' 
Therefore, the accessibility of bytes within data words Is becoming more important. 

The following prior art addresses character sequence processing. U. S. Patent No. 4,556.951 entitled 
Central Processor With Instnictions For Processing Sequences Of Characters", discusses a data process- 
ing system that has been adapted for processing character sequences and more specifically for examining 
character sequences for a predetermined characteristic. 

U. S. Patent No. 4.625.295 entitled, "Textual Comparison System For Locating Desired Character 
Stnngs And Delimiter Characters", discloses a data processing system including a text comparison circuit 
30 that IS used to implement different search strategies for detecting predefined words in a text sequence 

The Advanced Micro Devices publlcafion, Am29000 User's Manual , on page 8-36. lists a compare byte 
instruction which compares bytes of one register to corresponding'bytes of another register 

IBM Technical Disclosure Bulletin . Volume 23. No. 5. October. 1980. pages 2051-2056 entitled "Scan 
Instruction" discloses an instruction for scanning through a character string from left to right in a byte by 
36 byte fashion. ' 

IBM Technical Disclosure Bulletin , Volume 20. No. 8, January, 1978. entitled "Variable Ungth Byte 
Stnng Compare and Move Instruction" discloses an instruction for scanning a command string for a 
delimiter or for the first non-delimiter character. 

The object of the present invention is to provide an apparatus for loading and storing character 
40 sequences consisting of data units smaller than the unit of data storage or data processing. 

The present invention provides a word organised data processor having a natural data flow between 
memory and registers In transfer units of words aligned in both memory and registers on word boundaries 
wherein, to enable handling of variable length sequences of infbmiation items, each item individually shorter 
than a word - e.g., character strings -. apparatus is inserted between the memory and the registers 
46 responsive to invoked individual instructions, to transfer data therebetween, word boundary aligned at the 
. registers and variably aligned at memory. 

There is disclosed hereinafter, in a data processing system configured to transfer data In word units, an 
apparatus is provided for manipulating. In response to a single data manipulation instruction, a contiguous 
variable length sequence of data stored in units smaller than a word unit. The apparatus includes an 
50 instruction decoding circuit for decoding instrucUons and providing a command signal in response to the 
decoding of the single manipulation Instruction. A memory circuit is provided for storing data as addres- 
sable words wherein each word includes a plurality of smaller data units. Additional circuitry is provided for 
accessing a contiguous sequence of the smaller data units in the memory circuit in response to a command 
signal from the instruction decoding circuit. The first of the smaller data units of the unit sequence stored in 
the data memory is not stored on a word boundary in the memory circuit. A circuit for providing the smaller 
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data unit sequence to a plurality of data processing registers is also included. This circuit provides that the 
first smaller data unit in the sequence be aligned with the first registered boundary. 

In the preferred embodiment disclosed, circuitry performs the function of fetching character strings from 
byte locations in memory regardless of whether the bytes are stored on word boundaries. This circuitry 
5 includes a byte rotator circuit that arranges the locations of bytes within a single 32 bit word from memory. 
The byte rotator together with a byte merge circuit are used to merge bytes together into a sequence for 
storage in the general purpose registers in the data processing system. By providing the ability to rotate 
and merge bytes, it becomes possible to access bytes that are not stored on word boundaries in memory 
and provide them on the appropriate boundaries, i.e., register boundaries, to the data processing system, 
10 These bytes in sequence, make up character strings which can be manipulated as required in the general 
purpose registers. A second byte rotator together with a second byte merge capability enables one to store 
the sequence in memory on non-word boundaries. 

Thus the augmented embodiment provides the optional capability to access and manipulate bytes 
inside of words in a machine that naturally handles words, promoting the efficient storage of character 
16 strings In memory by not requiring that the character strings be stored on word boundaries while still 
enabling individual bytes of the character strings sequence to be manipulated as required by the application 
running in the data processing system. 

The present invention will be described further by way of example with reference to the aforesaid 
embodiment thereof as illustrated in the accompanying drawings in which: 
20 Figure 1 1s a block diagram illustrating the data flow of the embodiment; 

Figures 2A through 2D illustrate the data flow for a load string instruction; 
Figures 3A through 3D illustrate the data flow for a store string instruction; 
Figures 4A through 40 illustrates the data flow for a load string and compare byte instruction; 
Figures 5A through 5D illustrate the data flow for a store string Instruction for a null terminated 
2S character string; 

Rgure 6 is a flow diagram for the load string instruction; 
Figure 7 is a flow diagram for the storage string instruction; 
Figure 8 is a flow diagram for the load string and compare byte instruction; 
Figure 9A is an illustration of the load string and store string operational code format; 
30 Figure 9B is an example of a character string transfer operation where the character string, is 

specified by byte count; 

Rgure 9C is an example of instructions for performing a word by word compare operation; and 
Rgure 9D is an example of a byte comparison operation using the load string and compare byte 
construction. 

35 The disclosed embodiment adds to a word organised processor an apparatus for fetching and storing 
character strings. Character strings, as used in the following discussion, include a sequence of characters 
specified by byte data. The length of the byte sequences can either be specified with a byte count or by a 
termination character at the end of a sequence. 

Rgure 1 illustrates in block diagram form the data flow for a character string operation. Referring to 

40 Rgure 1. the Instruction decode unit 10 decodes the string operation instruction and provides the command 
signal over lines 12 to control logic 14. Control logic 14 includes a fixed point count register 16 (XPCR). 
Control logic 14 provides the control signals to operate the different components of the invention such as 
the byte rotators 22 and 52. registers 26 and 56, byte merge logic 30 and 60 and the byte comparator 34. 
The control lines from the control logic 14 to these Individual units has been omitted for the sake of 

45 simplicity. However, the operation of the control logic 14 with these individual components will be explained 
In flowchart form. 

In Figure r, a byte rotator 22 is shown connected to a register 26 and byte merge logic 30, Byte rotator 
22 is connected by bus 18 to a memory 20. In the following discussion, the memory 20 and the term cache 
will be used interchangeably since this arrangement can be used with both a data cache and a system 

50 memory. The memory 20 is addressable by words over the bus 18, The bus is assumed 32 bits wide 
making the word length 4 bytes. A byte comparator 34 is connected to the register 26 to compare the 
individual bytes In the register 26 to a predetermine byte. Register 26 is connected to the byte rotator 22 by 
a bus 24 and also to the byte merge logic 30 by bus 28. Additionally the byte merge logic 30 Is connected 
to receive zeros. The byte rotator 22. register 26, and byte merge logic 30 are one word in length, i.e., 32 

55 bits. 

The byte merge logic 30 is connected to the general purpose file 44. These are the general purpose 
registers for the central processing unit. CPU. The CPU is shown to have access to the general purpose 
registers file 44 by a bus 48. Each register In the general purpose register file 44 is 32 bits or one word in 
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length. The general purpose file 44 Is connected by a bus 50 to a second byte rotator 52. The second byte 
rotator is also connected to a the register 56 by a bus 54 and byte merge logic 60 by the same bus 54. The 
register 56 and byte merge logic 60 are connected by a bus 58. The output of the byte merge logic 60 Is 
connected by bus 62 back to memory 20. As before, the capacity of the byte rotator 52, register 56. and 

5 byte merge 60 is one word in length. 

The control logic 14 functions In the following manner. If the XPCR contains zero, then no operation is 
performed, otherwise, the following is computed by the control logic 14: 
LSBS » 2 Least Significant bits of Starting Address 
LSBX = 2 Least Significant bits of (XPCR1 - 1) 

10 Note: All Mask and Mark fields are shown in descending order from Most Significant Byte to Least 
Significant Byte. 

The control logic 14 compute the number of accesses required by the string instructions using the 
following formula: 

Number of accesses required = 1 + (XPCR-1)/4 + (LSBS + LSBX)/4 
IS where V ' indicates integer division. 

TABLE 1 



Load String 


LSBS 


# of Bytes to 


Load Byte 




Rotate Left 


Merge Mask 


00 


0 


0000 


01 


1 


0001 


10 


2 


0011 


11 


3 


0111 



30 

TABLE 2 



36 



Load String 


LSBX 


Last Access 




Mask Bits 


00 


1000 


01 


1100 


10 


1110 


11 


1111 



The Load Byte Merge Mask controls the Byte Merge 30. A '1 * indicates that this byte from memory 20 
4S should replace this byte from Register 28. A *0 ' Indicates that the byte from Register 26 should be used as 
is. The control logic 14 generates the Load Byte Merge Mask as shown in Table 1, 

The Last Access Mask bits are used to control the Byte Merge 30 for the last access. A '1 * indicates 
that this byte should be passed through as controlled by the Load Byte Merge Mask. A '0 ' indicates that a 
value of binary '00000000' should be passed for this byte. The control logic 14 generates the Last Access 
50 Mask bits as shown In Table 2. 
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TABLE 3 



Store String 


LSBS 


# of Bytes to 


Store Byte 




Rotate Right 


Merge Mask 


00 


0 


0000 


01 


1 


1000 


10 


2 


1100 


11 


3 


1110 



TABLE 4 



2d 



Store Strings With Only One 




Access 




# of bytes 


LSBS 


Memory 


in XPCR 




Mark Bits 


1 


00 


1000 


1 


01 


0100 


1 


10 


0010 


1 


11 


0001 


2 


00 


1100 


2 


01 


0110 . 


2 


10 


0011 


3 


00 


1110 


3 


01 


0111 


4 


00 


1111 



TABLE 5 



45 



Store Strings With More Than One Access 


LSBS 


First Mark 


LSBS + LSBX 


Last Mark 




Bits 




Bits 


00 


1111 


00 


1000 


01 


0111 


01 


1100 


10 


0011 


10 


1110 


11 


0001 


11 


1111 



The Store Byte Merge Mask controls the Byte Merge 60, A '1 ' indicates that this byte from register 56 
so should replace this byte from the General Purpose Register File 44. A '0 ' indicates that the byte from the 
General Purpose Register File 44 should be used as is. The control logic 14 generates the Store Byte 
Merge Mask as shown in Table 3. 

Mark bits are sent to the Memory 20 with every store access. A '1 ' indicates that this particular byte 
should be modified In the memory 20. A *0 ' indicates that this byte should remain unchanged in the 
55 Memory 20. For Store String Instructions which access only one word, only one set of mark bits are 
generated by the control logic 14 as shown in Table 4. For Store String instructions which access more 
than one word, the control logic 14 must compute a set of mark bits for both the first and last accesses as 
shown in Table 5. For ali other accesses between the first and last accesses, a set of mark bits containing 
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air 1 ' s is used. 

To understand the operation hereof, it is best to view an example of a string transfer. In Figure 2A, the 
contents of memory is illustrated as word 1 and word 2 in memory 20A and word 3 and word 4 In memory 
20B. Word i and word 2 contain the string "ABCDE". In this example, the string "ABCDE" will be 

5 transferred from the word 1 and word 2 location to the word 3 and word 4 location. Presently the word 3 
and word 4 contain a second character string "VWXYZ". 

When the load string instruction Is first received, the address of word 1 is computed and the contents of 
word 1 is fetched. This is illustrated in Rgure 2B by the contents of the memory bus in 18A. Note that the 
byte count has been stored In the XPCR register 16A. The contents of the word 1 are received and they are 

70 rotated left 2 bytes as illustrated by the byte rotate logic 22A. This is stored in register 26A. The byte 
merge logic 30A and the general purpose register 44A are unchanged. 

In Rgure 2C the address of word 2 is computed and then word 2 is fetched as illustrated by the 
memory bus 18B. The contents are then rotated left 2 bytes by the byte rotate logic 226. A byte merge 
operation is then performed with the left 2 bytes of register 268 and the right 2 bytes of the byte rotate 

15 logic 228 which are output from the byte merge logic 30C as shown. The contents of the byte merge logic 
In 30C are then stored in the general purpose registers 44C as shown. The byte rotate logic 22B contents 
are stored into register 26C. Since the byte count Indicates that all of the string has been fetched, the final 
step is to merge the remaining contents of register 26D with the remaining string. Therefore, the left most 
byte in register 260 is merged in the byte merge logic 30E with trailing zeros and the contents then stored 

20 in the general purpose register file 44E as shown. Note that the general purpose register file 44E includes 2 
sequential registers for storing the sequence. 

The next operation is to store the contents of the general purpose registers to memory in words 3 and 
words 4 (See Figure 3A). In Rgure 3B, the count remains at 5 in the XPCR register 16H. Since the first 
word in memory is to contain only 3 bytes, the contents of the general purpose register containing the first 

25 4 bytes is rotated right one byte and the memory Is told to only update bytes 1, 2 and 3 (see bytes 
indicated for the memory bus 62H). Bus 62 (Figure 1) includes lines that indicate to the memory 20 which 
bytes are to be updated. 

In Rgure 3C. the next word 4 is addressed and the remaining portions of the data string is merged in 
the byte merge logic 60J and provided onto the memory bus 62J shown modifying the bytes as indicated. 
30 the memory 20L illustrates the final transfer of the string (See Figure 3D). 

In the preceding example the character string was specified by a count. In the following example, the 
character string is specified in length by a termination character at the end of the character sequence. 
However, count register 16 must be loaded with a count larger than the actual string itself. The load string 
and compare byte instruction is then executed where the byte that is being compared is the terminal 
35 character. In Rgure 4A. the memory 20 contains the string to be transferred consisting of the character 
string "ABCDE" and the termination character "0". This character string will be transferred from words 1 
and 2 in memory 20M to words 3 and 4 in memory 20N. 

In Figure 4B. the fixed point count register 16M is loaded with the number 8 (which is larger than the 6 
bytes to be transferred). The address for word 1 is then computed and the contents of word 1 are accessed 
40 on memory bus 18M. This contents are rotated two bytes to the left in byte rotator logic 22M and stored in 
register 26M as shown. 

In Rgure 4C, word 2 is fetched from memory as shown in memory bus 18N and again is rotated 2 
bytes to the left as illustrated in the byte rotate logic 22N. The left 2 bytes of register 26N are merged with 
the right 2 bytes of the byte rotator logic 22N in the byte merge logic 30P. The byte merge logic contents 

45 SOP are stored in the general purpose register 44P. Register 26P now contains the previous contents of the 
byte rotate logic 22N. The last step merges the contents of register 26Q (left 2 bytes) with the right 2 bytes 
of the byte rotator logic 22Q in the byte merge logic 30R. The count in the fixed point count register 16Q 
has been updated since a positive comparison to the termination character was made. The contents of the 
byte merge logic 30R are then stored into the general purpose registers the general purpose registers as 

50 shown in 44R. 

Again, Rgures 5A through 5D illustrate the storage of the general purpose register containing the string, 
terminated by the null character, to the memory location 20N. In Rgure 5B, the contents of the general 
purpose register 44S containing the first 4 bytes is rotated 1 byte to the right. This contents is then placed 
in register 568 to the byte merge logic 60S. The 3 bytes indicated on the memory bus 628 are updated. 
55 In Rgure 5C, the contents of register 56T are then merged with the remaining sequence in the byte 
rotate logic 52T to form the contents of the byte merge logic 60U which is placed on the memory bus 62T 
to update the bytes as indicated. This results in the transfer of the character string to the memory location 
20W in Rgure 5D. 
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Figures 6 Illustrates a flowchart for the load string Instruction. The load string instruction is first decoded 
In Step 100. In Step 102. the starting byte address is computed. This includes accessing the cache or 
memory word that contains that byte address. In Step 104, determination is made as to whether the string 
has been terminated. If not, in Step 106 the starting byte Is left justified and stored in the register 26, the 

5 address is Incremented and the next word is fetched. In Step 110, again it is determined whether the end of 
the string has been encountered. If not, in Step 112. the byte rotate operation is performed and the newly 
fetched bytes are merged with the previously fetched bytes in the register 26. The merge contents is then 
stored to the general purpose register 44. The rotated contents are then saved in the register 26. The 
general purpose register pointer 46 is Incremented, the address is then incremented and the next word 

10 fetched. Returning to Step 108. the end of the string has been accessed, the starting byte is left justified 
and the last bytes merged with trailing zeros. The contents are then stored in the general purpose register. 
In Step 117, the next instruction is decoded. Returning to Step 114. again when the end of string has been 
accessed, the contents of the memory bus are rotated and merged with trailing zeroes and the contents in 
the merge logic are stored in the general purpose register. The process then proceeds to the next 

IS Instruction in Step 116. 

Figure 7 illustrates the store string instruction. The store string instruction is decoded In Step 120. In 
Step 122. the starting byte address is then computed. From this it is determined the bytes that will be 
stored in the modified memory word! These data is fetched from the general purpose registers 44 and 
rotated to align the bytes In their appropriate positions. These bytes are sent to memory with an Indication 

20 to memory to modify the specific bytes. The rotated data Is saved in the register 56. In Step 124. it is 
determined whether the end of the string is being encountered. If not. in Step 126, the address for the next 
word Is incremented, the general purpose pointer is incremented, and the number of bytes to be updated Is 
then computed. Data is fetched from the general purpose registers 44, rotated and merged and provided to 
memory. The rotated contents is then saved in the register 56. When the end of the string is encountered. 

25 the process proceeds to Step 128 to decode the next instruction. 

Figure 8 illustrates the load string and compare byte instruction. In Step 130, load string and compare 
byte instruction is decoded. In Step 132. the starting address of the byte is computed and the word 
containing this byte is accessed. In Step 134. it Is determined whether the end of the string has been 
encountered. If not, in Step 136, it is determined if any of the bytes are equivalent to the comparison byte. 

30 If not, in Step 138, the starting byte Is left justified and then saved. The address Is Incremented and the 
next word is accessed. The byte rotate and merge function are performed and the contents are stored In 
the general purpose registers 44. The rotated contents are then saved in register 26, In Step 146, it is again 
determined whether the end of the string has been accessed. If not. In Step 148. It is determined if any of 
the bytes are equivalent to the comparison byte. If not, in Step 150. the general purpose register pointer 48 

35 is incremented, the byte address is incremented and the next word is accessed from memory. The byte 
rotate and merge operations are performed with contents stored in the general purpose registers 44. The 
rotated contents are stored register 26. Returning to Step 140, the starting byte Is left justified and stored in 
the the general purpose registers 44, In Step 142. It is determined if any of the bytes are equivalent to the 
comparison byte. If so, the process computes the number of the first matching byte and then updates the 

40 byte count and flag accordingly. The process proceeds to the Step 159. to decode the next instructions. 

Returning to Step 152. the general purpose register pointer 46 is Incremented and the byte rotate and 
merge operations are performed with contents stored In the general purpose registers 44. In Step 154, it Is 
determined whether any of the bytes are equivalent to the comparison byte. If so. In Step 156. the number 
of the first matching byte is computed, the byte count changed and flag set In Step 158. the next 

45 instruction Is decoded. 

Figure 9A Illustrates the format for the load string and store string instruction. These string instnjctions 
can be used to perform rapid string moves by being coded in a small loop. The address of the starting byte 
of the string is computed by adding general purpose register RA + RB. For load string "Is", the string is 
loaded starting with register RT. then RT + 1. and so on until the end of the string. For the store string 

50 Instruction "ss", the string is stored starting with register RT, then RT+1, and so on until the end of the 
string. The length of the string Is contained in the register called the Fixed Point Control Register (XPCR). 
For example, if the XPCR is loaded with the count of 32 bytes, a counter can be established that represents 
the number of 32 byte blocks which will be moved and be placed in register Rent. The starting address of 
the source string is In Rs and the starting address of the destination string is in register Rd. Eight working 

55 registers are required for 32 bytes (Rw-Rw + 7). Register Rx contains the current index value starting at 
zero. The loop for moving the string Is Illustrated in Rgure 9B. 

Because the Is (load string) instruction pads the registers with zeroes past the end of the string, this 
Instruction can be used to perform string comparisons In which the comparison occurs a word at a time. If 
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Rsl and Rs2 are the starting addresses of the 2 strings and Rv is another set of working registers, the loop 
illustrated in Figure 9C will perform this function. 

For null terminated strings (as in the language C), the Load String and Compare Bytes "Iscb" 
instructions can be used. The byte to be compared with is in a different field within the XPCR. Since Iscb 
5 instructions update the byte content field in the XPCR to the number of the matching bytes, the store string 
instruction which follows it will only store up to and including the null character. The register usage is as 
described above except that Rent is not needed since the length of the string Is determined by the location 
of the null character. Assuming the XCPR Is loaded with the null character and the byte count is 32. the 
loop to accomplish the function is Illustrated in Figure 90. 

10 

Claims 

1. A word organised data processor having a natural data flow between memory and registers in 
76 transfer units of words aligned in both memory and registers on word boundaries, wherein, to enable 
handling of variable length sequences of information items, each item individually shorter than a word - e.g., 
character strings apparatus is inserted between the memory and the registers, responsive to invoked 
individual instructions, to transfer data therebetween, word boundary aligned at the registers and variably 
aligned at memory. 

20 2. A processor as claimed in claim 1 , wherein the inserted apparatus includes 

instruction decoding means for decoding instructions and providing a command signal in response to a 
single manipulation instruction; 

means in response to a command signal from the Instruction decoding means for accessing the contiguous 

variable length sequence of information items from the memory, the first information item of the sequence 
25 being stored in the memory without restriction, i.e., potentially not on a word boundary; and 

means for providing the sequence to a plurality of data processing registers, the first information item of the 

sequence being presented aligned on the first register boundary. 

3. A processor as claimed in claim 2. wherein the apparatus further includes means for altering the 

positions of the infonmatlon items in a word component of the sequence. 
30 4. A processor as claimed in any preceding claim wherein each Information item is the same integral 

sub*multipie of the processor word size - i.e., a byte. 

5. A processor as claimed In claim 4, wherein the inserted apparatus includes means for comparing 
bytes in the sequence with a predefined byte. 

6. A processor as claimed in claim 6. wherein the inserted apparatus further includes means for writing 
35 the byte sequence from the registers to the memory by updating individual byte locations in the memory 

means. 

7. A processor as claimed in claim 6 wherein the writing means for storing the byte sequence in 
memory includes means for determining a predetermined byte in the registers to be the trailing byte in the 
last word of the register stored byte sequence. 

40 8. A processor as claimed in claim 7. wherein the byte sequence includes either a predefined byte 
character specifying sequence termination or a count specifying the number of bytes in the sequence. 

9. A processor as claimed in claim 8, wherein the instruction specifies a first register of a sequence of 
registers and the apparatus includes means for storing the byte sequence in the specified first register and 
sufficient of the remaining registers in the natural processor nominated sequence thereof. 

46 
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(load string instruction DECODEd)/100 



COMPUTE STARTING BYTE ADDRESS. 

ACCESS CACHE WORD CONTAINING THAT ADDRESS. 




102 



108 



LEFT JUSTIFY STARTING BYTE. 
SAVE IN REGISTER 26. INCRE- 
MENT ADDRESS AND ACCESS 
NE)CT CACHE WORD. 



LEFT JUSTIFY ACCESS START- 
ING BYTE AND MERGE WITH 
TRAILING ZEROS. STORE INTO 
GENERAL REGiSTER{S). 




BYTE ROTATE ACCESS AND 
MERGE WITH REGISTER 26. 
STORE INTO GENERAL REGIS- 
TER(S). SAVE ROTATED AC- 
CESS IN REGISTER 26. INCRE- 
MENT GENERAL REGISTER 
POINTER. INCREMENT ADD- 
RESS AND ACCESS NEXT 
CACHE WORD. 



BYTE ROTATE ACCESS AND 
MERGE WITH REGISTER 26 AND 
TRAILING ZEROS. STORE INTO 
GENERAL REGISTER(S). 



( DECODE NEXT INSTRUCTION ) 

c 

^116 



FIG. 6 
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STORE STRING INSTRUCTION DECODiD) r^120 



COMPUTE STARTING BYTE ADDRESS. 
COMPUTE BYTES WHICH WILL BE MOD- 
IFIED IN THE CACHE WORD. FETCH 
DATA FROM GENERAL REGISTER(S) 
BYTE ROTATE DATA TO AUGN BYTES 
AND SEND TO CACHE. SAVE ROTATED 
DATA IN REGISTER 56 



.122 




INCREMENT ADDRESS TO NEXT WORD. 
INCREMENT GENERAL REGISTER POINT- 
ER. COMPUTE BYTES WHICH WILL BE. 
MODIRED IN THE CACHE WORD. FETCH 
DATA FROM GENERAL REGISTER(S) 
BYTE ROTATE DATA, MERGE WITH 
REGISTER 56, AND SEND TO CACHE. 
SAVE ROTATED DATA IN REGISTER 56 



c 



128 



DECODE NEXT INSTRUCTION 



RG. 7 
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( LOAD STRING AND COMPARE BYTES INSTRUCTION DECODED ^ 

I — 



COMPUTE STARTING BYTE ADDRESS. ACCESS 
CACHE WORD CONTAINING THAT ADDRESS. 



END 
•OF STRING 



-132 



130 



140 

L 



LEFT JUSTIFY STARTING 
BYTE. STORE INTO 
GENERAL REGISTER(S) 




BYTES=COM-v^NO 



LEFT JUSTIFY STARTING BYTE. 
SAVE IN REGISTER 26. INCRE- 
MENT ADDRESS AND ACCESS 
NEXT CACHE WORD. BYTE 
ROTATE ACCESS AND MERGE 
WITH REGISTER 26. STORE 
INTO GENERAL REGISTER (S). 
SAVE ROTATED ACCESS IN 
REGISTER 26. 



5; 



146 



COMPUTE NUMBER FOR MATCHING BYTE. 
UPDATE BYTE COU NT AND SET FUG. 

>-^= 

(DECODE NEXT INSTRUCTION) 

1 

159^ 

152 

\ 



NO 



END 

OF STRINGS YES 




148 

ANY 
.BYTESoCOM- 
PARISON 
3YTE7 

NO 



INCREMENT GENERAL REG- 
ISTER POINTER. STORE IN- 
TO GENERAL REGISTER(S). 
BYTE ROTATE ACCESS AND 
MERGE WITH REGISTER 26 



YES 



150 



ANY 
,BYTES=COM-s 
PARISON 
3YTE?^ 



INCREMENT GENERAL REGISTER 
POINTER. INCREMENT ADDRESS 
AND ACCESS NEXT CACHE WORD. 
BYTE ROTATE ACCESS AND MERGE 
WITH REGISTER 26. STORE INTO 
GENERAL REGISTER (S). SAVE 
ROTATED ACCESS IN REGISTER 26. 



COMPUTE NUMBER FOR MATCHING 
BYTE. UPDATE BYTE COUNT AND 
SET FLAG. 



158 



(^DECODE NEXT INSTRUCtIon) 



FIG. 8 
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1s rt, ra, rb 
ss rt, ra, rb 



Opcode 


RT 


RA 


RB 



FIG. 9A 



Loop: 



Is 


Rw, Rs, Rx 


/* Load 32 bytes from source */ 


ss 


Rw, Rd, Rx 


/* Store 32 bytes to destination 7 


add 


Rx, Rx + 32 


/• Increment Index 7 


sub 


Rent, Rent -1 


/• Decrement Count 7 


brnz 


Loop 


r Branch if Rent not zero 7 



FIG. 9B 



Loop: 



Is 


Rv, Rs1,Rx 


r Load 32 bytes from source 1 7 


Is 


Rw, Rs2, Rx 


/• Load 32 bytes from source 2 7 


add 


Rx, Rx + 16 


r Increment Index 7 


cmp 


Ry, Rw 


/* Compare 1st Words 7 


brne 


Miscompare 


r Branch if not equal 7 


cmp 


Rv + 1, Rw + 1 


r Compare 2nd Words 7 


brne 


Miscompare 


r Branch if not equal 7 


cmp 


Rv + 2, Rw + 2 


/* Compare 3rd Words 7 


brne 


Miscompare 


r Branch if not equal 7 


cmp 


Rv + 3, Rw + 3 


r Compare 4th Words 7 


brne 


Miscompare 


/* Branch if not equal 7 
/• Decrement Count 7 


sub 


Rent, Rent -1 


brnz 


Loop 


r Branch if Rent not zero 7 



FIG. 9C 



Loop: 1 scb Rw, Rs, Rx r Load up to 32 bytes from source 7 

ss Rw, Rd, Rx /* Store same bytes to destination 7 

add Rx, Rx + 32 /* Increment Index 7 

brne Loop 1* Branch if null not found 7 



FIG. 9D 



